So yeah, we put this stuff on so that so that we have some record of the whole thing, what uh are going to say. And uh the whole idea here is says also put the lapel on. Uh the whole idea of this meeting is that it's not me who will be talking all the time. So so that would be a boring that would be definitely boring meeting. Yeah, yeah, yeah. So I so the p the actually the reason why we got together is that it's ab it's about the time to A_ make sure that when Petr leaves us uh I'm afraid it may happen, right. I mean it or did you didn't change your mind yet. Or your girlfriend didn't change your mind. So so that whatever he's whatever he's been doing uh is not uh forgotten and then people can stay on his shoulders and uh I hope that he won't get out entirely out of uh out of uh this problem. But still we we need to continue on this. So I think the Joel will be is is my main hope that he's uh he's going to pick up uh the key-word spotting and some of these ideas and also the reason why p uh I p uh uh Petr Motlicek is here is that Joel is also getting interested in uh OPAL models of uh tem temporal envelope. Yeah, and of of course Pet uh Petr for signals. So about it but Petr Motlicek too. Eh both of you you know. So but but but you will be here and from different point of view, eh. Petr Motlicek is working mainly on speech coding. Yeah, and uh so so this is um yeah, I guess this is it. So m I I guess that really you should be talking more than anybody else because you should be asking questions. Yeah. Yeah, I know. So what is it that you need to know. And uh Yeah. Oh this were ho how it started basically because that was uh Marios Athineos, yeah, and Dan Ellis. They had a paper, I believe it was not in ICASSP A_S_R_U_ or something, right. It was a A_S_R_U_ but that was a time when they weren't using very long uh uh uh d uh segments. It was like fifty millisecond or something what they were playing with at the time. Yeah, I know. So then i the story was that then uh Marios presented that at so some meeting and we realised oh, that might be cute thing uh for uh estimating uh spectral energies. Because that's what we want. And we'll the the way we've uh we are doing it is kind of I think it works, but it's little bit inelegant. somehow stupid, so so first do the frame by based uh estimate, and then you you get stick to hundred hertz sampling, ten millisecond sampling or uh something, and then you just pick up the trajectory. So so Marios was here for half a year maybe or even more uh, and uh started work with Petr who also j just came. So they were working little bit together, mainly M Marios was uh programming in MATLAB and Petr was putting it into fancy is yeah and uh but uh and but then we were hoping uh uh uh th to pick up on that little bit more, and we we did, right? You did a little bit of work. Yeah. Hmm. And we never published that anywhere, right? Huh. Yeah. Yeah.. Yeah. We were also doing a serious thing that we were stepping it by ten millisecond, right? At the time. Right. Yeah. Right. That was a reasonable start, yeah. Yeah. So you may be the only one in the whole world now. Because uh because Chi Fong is gone and uh and uh Barry is gone. So Morgan was joking no what he informed me they are gone from ICSI. Uh Chi Fong went to T_I_ and uh and uh uh uh B Barry is at Lawrence Livermore lab. He basically got a job. He he graduated. And he's he'll be working on things which he cannot talk about. He can't even say it on a microphone here I heard. It's uh i yeah, it's classified work typically. It's and so so jo Morgan was h only half joking. He was s somebody was asking him you were or so s or somebody was asking him something. Uh or there was uh there was uh Matthew and he said well, probably we'll be coming to ask you because nobody really knows anymore at ICSI. So please stay. Uh th nobody. That was Barry. And now it's uh cl close histo actually Brno is doing a a lot of that now. They picked up the software and they play with the and nets. So that might be that Brno will be at the soonest as uh a ultimate expert on on the whole thing. N n where which software? No, they ne Quicknet was done uh uh well Quicknet itself was done actually long time ago at ICSI. Is that so? Group? Okay. So we h have to treat him well. Well the whole idea then is that then you start playing with it, right? And you yeah, and Maybe that might be the On frame. Oh. He used to have before he lost all the data. But he will have it again.. Yeah. knows more than than than I do. S so that's good. But certainly when it comes to like uh the the OPAL modelling of temporal trajectories, even pe Petr Fousek is not here, you probably know a lot about it by now. No. That uh uh ap approximating Hilbert envelope by OPAL model. I a v a V_L_P_. Uh as as I say when you when you are not here, that that Petr Motlicek knows enough about it too, because same ap yeah, different uh application, different uh different software or something. If if you make a b broader bands. Because when the band is narrow, the c excitation is close the carrier is close to s sinusoid basically, co-sinusoid. You may almost want to dis uh um i b uh uh uh yeah, uh b use the white-board, I mean uh and uh that's why we have it here, right, so that I don't think so. I think no, we are wired. Except Pulses. This is L_P_. This is a standard time domain L_P_, right? But since you are in a different domain, this is he here we are in time domain, right. But but you are you are really in a frequency domain. So it uh the line's turning to into sines, right. And you p and since this this I mean because this thi this this spectrum is al has also line spectrum, right. I mean all components. So if you are then if we are in a frequency, th your source looks like this. And It's uh yeah, right. So you know that. And this is where And the biggest problem currently is that the carrier is actually not cosine but uh something little bit more uh uh interesting. And if the frequency doesn't change over the wa over over the lengths of the uh Yeah. Yeah. But but f f fortunately, because of he uh because of properties of hearing more or less, for the higher bands you if you put in just s some garbage, some noise, it doesn't really m w matter all that much. It w sounds little bit worse. But uh it's not a big tragedy. Uh where you really w if you want to g do a good voice speech, uh you'll want to get the lower bands correct. And if you put in uh noise altogether, it sounds like a whispered speech, uh but you understand perfect. By noise. So now we have to kill them both. Because I mean gave all the secrets away. I mean Qual computers is going to Yes. If Motorola or Nokia gets hold of this uh tape, then we are in trouble. Uh hopefully it by then it will be protected and and everything, right? It's it's uh it's but listen, this is a science. I mean it's uh confidential to the extent that we are not uh i uh publishing it yet. But we w we of of course we will. And uh and uh but uh we have to make sure that that uh if Qualcomm has a interest of who's funding the work the they have to have a chance to uh to to patent it or something. But that's about all. This is not a public uh public uh presentation. So this i is not even a problem. The only problem would be if but you know, nobody can beat p Petr uh Petr Motlicek anymore in I mean if even if if Nokia knows everything if you if you sell it tomorrow who did you work for th the uh uh text no, t H_P_ H_P_. They will take half a year before they would even realise what it is. And by then so it's not no problem. Okay. So that's where that's uh that's uh To get the uh When you comp uh w what you com when you compress the effectively they'll want the signal which we are modelling. And uh so then you get the fitting to compress signal. Essentially if you were in spectral domain you would compress the power spectrum. Here you compress the the th this th this i the what ful yeah of D_C_T_ of of the signal, right. And so the w then you have uh predictor is fitting actually this compress signal. But you know how much c it was compressed. So you can always expand it back so that you get the c that you get a fit to the to the original thing, except that if it is different. Because uh it ha th basically this OPAL modeling has a different properties. And uh it's So we can tell you a little bit more about it. This is an uh sort of old technique which uh it And it was my thesis even. P_H_D_ thesis actually. A small sorry? Yeah it was S_T_L_P_, was called spectral transform linear prediction. It was my first ICASSP paper ever. It was uh it nineteen eighty three Boston. You guys were still in kindergarten, right? Oh, this is terrible. So But no nobody nobody was using it for anything, except it was cute. But that was about it. And uh and but now it seems like that is since since what we are doing here we are normal L_P_C_ is fitting uh peaks, right. But when you are modeling Hilbert envelope, you don't necessarily want to fit only peaks. You sometimes want to get a basically good estimate of the of the envelope. So that helps you. Because in this uh thi thi this spectral transform allows you to to find the right balance between how well it fits peaks, how how well it fits dips and uh and yeah, and sort of the so this is g so I think that basically I don't know what else I mean you should be asking questions, right. So Yeah. SAPA. And then yeah this is a this they are two papers in uh one is in SAPA, one is in uh U_ Interspeech. uh in a c Cheju. And one was in the A_S_R_U_. It was from Virgin Islands. SAPA was a s I I tell you what it stands for, statistic and ad and something ad and uh perceptual approaches. I was on f I was on a f w s organising committee. I should know. But statistic and uh and something approaches. Uh pe perceptua and statistics and perceptual a approaches. That was uh But they are the same, right, as uh as uh and then also is there is some more work actually can be traced down. I think there wa uh the reading Les Atlas's papers are very good. Because Les Atlas is doing a lot of work on coding in this domain. He doesn't use OPAL model. But he uses D_C_T_ and then he p uh so the so so and it and Les is excellent guy. He's very good mathemat mm h very good theoretician. Basically he's a he's at Stanford the guy. And uh really he likes to write things well. So he's very rigorous uh as opposed to s to me for instance. So if you read that then it then you know that there are no errors and Yeah, uh but um so whatever you g whatever you get hands on from Les Atlas uh for over past five years uh regardi this two years ago, last year on one of the ICASSPs he had like five papers, right. Or various aspects of this because uh he's eager to Is that so? I didn't even see it. Yeah. Or Uh-huh. I don't To fill in, yeah, with all kinds of parameters. He has all kinds of speech recognition results, right. It was the first time you start using the cluster, I remember, or some uh some parallel processing. That was a big thing here. Everybody hated him for t uh temporarily. I remember. Uh-huh.. I yeah, I remember Frank came to me and says is everybody was computing like Marius, we would be all dead. But I say, Frank, but I want everybody to compute like Marius. And Well it is important, you know. Whereas I mean this is something which we never talk about, right. I hear it how much work uh does put in uh here. Because it's true it's true this that uh that things sped up tremendously since I came. I mean uh of course partially because they started to buy some more uh more hardware. But also be p some at least some people basically know how to uh uh write faster programs. Uh this is fantastic. And used to be two days, yeah.. Uh. So do you already know where uh what is the situation with you? You go back to Prague for a while. And then did you get anything in F in France? I got some ideas by the way. I yeah. Yeah. Yeah. Uh-huh. Oh it's that's the that's the would be a pity, right. You you ca you t so you don't die of hunger. And so yeah. Yeah, that's not the idea. No. We'll we'll uh we will we'll try something, let's see what we can do. Um So Joel, your main basically thing is as I told you to t uh when we were walking that uh that what whatever Petr did, that it doesn't die, but it i it carries. And obviously uh v we very much hope that you get into that and you get excited and you do s more and uh or uh maybe even different things. But uh That it actually works. It's not that it doesn't work. Uh Yeah. Then And i and this and this is all on the record. So if we uh if uh if we you can hold him to his word. J ju you rewind. Browse through the meeting. Okay. How we doing time-wise? Yeah I think half hour it's fine, right? But you will be here until almost until end of uh Yeah. So breaks my heart. It's just like I was just start teaching and then I n I'll need an assistant. Hmm. Yeah maybe. Uh yeah. Uh d hopefully it's make m makes uh pay pay some extra. So we'll see. Either speech. Uh but it's co actually compilation of uh really compilation of two classes I was giving at O_G_I_, which is like speech processing and and uh perception. And so I tried to put it together so that it makes some sense. No. No. I've it's uh i it I should have at least four. Because this is a p like uh graduates. But I have one, alright. Guaranteed. I have one guaranteed that I don't know if anybody else is uh Uh uh Sorry? I s I don't know. I don't know if it needs to be four credit or not. I don't know how much I like people without uh n not taking it for credit because then they just sit there and having good time. But it's okay. No no, it's okay. Yeah. Guillermo is taking, very good. And so I'll I_S_ who already have at least two. If uh Yeah, we pro I probably get four. And uh it's possible. That's for sure. I mean that's Okay. Mm-hmm. Mm-hmm. Yeah, the notes notebooks. Uh-huh.. But that's good, you know. It's uh I didn't get into too many things actually. We know the reason was that at the time we thought I was teaching with Herve. And we thought I we have much more time. And suddenly we ran out of time basically because what happened was that uh it was supposed to be three hour lecture. But it was only two hour lecture. And uh Herve assumed that it's going to run longer. Or I don't know what he assumed. But suddenly we sa s he said well so so where are you. I said well I'm just slowly getting into L_P_C_. He said why but I we have only three more lectures. I have to teach H_M_M_ and everything. So I the L_P_C_ part was like I had to do it in one hour. And then everybody was c totally confused I guess. And Yeah, they were also confused. Yeah. It was at D_B_F_L_, yeah. But Oh we'll see. Okay guys. I think that um this unless somebody has uh still anything incredibly important Right, it's uh ju it's a little bit up to you. You have to p you have to back this uh uh ha h specially Petr. Because just to make sure that and yeah. Yeah. Do you think that you will be done by May? Is it plan? I think so. Yeah, we made a mistake, I tell you,. I think if I had to do it again and we made a mistake we should have register you here at E_P_F_L_. You would have had a degree by now from E_P_F_L_. Yeah, that was a mistake. Sorry yours You were or you were here t you were you were here how long you were here? Two years? Yeah. Where are the Yeah. Right. So is she getting uh uh another position there? Or what is the re reason of What a boyfriend, huh. Oh I see. So that's good. Mm-hmm. Mm-hmm. Yeah. Mm-hmm. Mm-hmm. Yeah, I know. Yeah. Yeah. I know. Yeah. Um okay, Otherwise I am also much more available than I was. I mean when you came it was just like cra one of the craziest uh uh times of this year. Uh uh it's not it's not that I was very productive, that I was doing anything useful. But it just like it's a little bit like writing thesis, all the time re-writing and change the do make a little changes and uh uh and Yeah, he's ver no he has many more sp many more people so to take care of. Yeah. Well I'm typically I try to be, because uh because, you know, I mean for s this is also my livelihood, right, students. So I have to make sure that that that uh they need to know something that uh so feel free basically to stop by any time and um yeah and uh it's not that I know everything. But if I know I uh I think I sort of know a little bit like what you guys were doi uh doing, right. But not uh not obvi every detail obviously. Yeah. What yeah, you don't know why why you are doing it anymore. Yeah. Oh boy. Yeah. Oops. Okay, I think my mic is falling falling. So I think we go, huh? Thank you very much. Thanks a lot.
Yep. S so Joel is going to talk. That's mine. Well, may from different point of view. But Yeah. Right. Let me just tell us so what you know already about uh, I don't know, how far you are with the s features. Or So you know s something like that. Mm-hmm. But t what Christos was doing. Right. Yeah. Mm-hmm. Mm-hmm. Mm-hmm. Yeah. What yeah, yeah. We shouldn't smile. Right. Yeah. Yeah, but I'm using that for different different reason, right. So it's uh speech coding. So uh what is the task is usual just take the speech of course somehow to encode it into as as small number of beat as possible and with a reasonable quality of course. And again to e uh decode it. And I'm using F_D_L_P_ for that, still playing with uh more less one second long segments. So just dividing the speech into one s one second frames doing D_C_T_. It's just the standard wor of work. And then doing some compression and computation of yeah, yeah yeah, sh oh, yeah. But you are doing the same, no? Yeah b yeah yeah you take the the D_C_T_ and you just divide it into several sub-bands, frequency bands with a different uh I d I do. You mean No, no, no. No. I don't do. Yeah, just one second with without overlapping. But in frequency domain there is a overlap. Because you can choose which filters you want, more less, or just trajectories of those bands. So there is overlapping. I'm using those Gaussian filters or classical those those TRAPet trapezoidal like in uh M_F_C_C_s, right. And then processing each f yeah, P_L_P_s. Okay, tr triangulars. That one. Then I'm doing s doing uh independent processing of each frequency band. So I'm computing that Hilbert envelope, which is just the envelope of some temporal trajectory. And more or less you can divide this into a Hilbert envelope, Hilb Hilbert carrier. We use something like L_P_, a linear prediction envelope. Auto-regressive model and excitation signal. Yeah, so those are two parts or items you need somehow to encode if you want again to get original speech or reasonable quality. And yeah, there is some compression factor, which is again when you're computing that auto-regressive model from F_D_L_P_. You can compress the spectrum somehow in in such a way that you rather um preserve those even also those dips, not only peaks in the spectrum. And it sounds better for some kind of speech signals. So it's kind of logarithm when you use logarithm for M_F_C_C_s, right. So it's again compression factor, something like th and but then you have to uncompress it again of course to get back the speech. And there other other things which I'm doing. But it's not related to speech recognition so much, or but Well Right, right. Well, it's yeah. Sure, sure, I know. I mean it's more or less the same approach, right. There is m uh just more right, right, right. Yeah. With what? Yeah yeah. Sure. Number of bands and You know, that's the problem for speech coding. Because I don't have results like in speech recognition. So of course I have to listen it and and s well so now what I'm using are fifteen bands, which seems to me fit fifteen frequency band, which seems to me quite reasonable. We can go down till like twelve and still okay I would say. But then you start hearing difference. Or also that's another problem. Once you can go down with frequency bands, then you will have much uh more c uh complicated data. Uh let's say excitation signal, or that carrier which yeah because broad bands, right. Yeah, maybe like that. No? So there is more and more s uh frequency components let's say or something like that in one broad band. So No. L_P_C_ will maybe. But then you have still another part, sometime something like a source and a filter. So when you are doing speech coding or Yeah. Why? I don't know now. It's quite long. Well I mean it's it's the same like when you are doing L_P_ L_P_ synthesis, right. So you've got source signal. I don't know. Source. Uh Yeah. And then there is some filter, which is that one divided by A_ Z_. And that's and that's it. And the source can be a noise signal or it can be some uh sequence of uh what is that? Pulses, right. This is yeah. So and it's more less or less the same what I'm doing. So this is some envelope, which is in our case related to Hilbert envelope. And this is a source signal which is Hilbert carrier. Right. Right. Exactly. Yeah. So this is now going to be frequency. Yeah. So more or less if we uh can we can plot it. So what we can see is some Hilbert envelope like this. And then there is that Hilbert carrier. Or this not Hilbert carrier. But this is the signal which we get from one di one frequency band. So you compute D_C_T_, right, from signal? Yeah yeah, exactly. And what we get from that is this Hilbert envelope, right. And the second, which should be that carrier should be something like this. It's just really close to cosine we m we might see if we do it do it properly, right. And this is what you also have to somehow transmit, or at least try to transmit in order to get Right, right. Right. No, I don't need L_P_C_. No, no. This is a residual for me. Yeah, I'm just wan I just want to transmit this signal right. It's and what you need is this envelope and this one carrier. And you again get the same signal, this this this shape, doing I_D_C_T_. Putting everything together. Right. But if you take a look for example for a s if you have a very g good music signal, which is very uh periodic you can really see something like there is one peak and And frequency more or less doesn't change. It's uh related to uh that frequency of the broad band or how it's called. If you add those filters, which are uh wider and wider this is that centre frequency which is somehow related to this one. So then you can transmit uh just fe It's worse and worse. Don't then you don't see any anything like cosine. But Right, right. Everywhere. It sounds like L_P_C_ when you excited by noise. It's better, I would say. Uh better quality. Definitely better quality. It lower bit rate. But that's a problem. So so ... Yeah, sure. Objective measure, well, it was some those uh distances, spectral distances well, Edinburgh saw probably. So well, I mean I h Alright. Yeah. Sure. I see. Right. Yeah. Yeah, yeah. Also there are papers about it that f if you have like twelve frequency bands it's somehow optimal. Compression is only to really to somehow feed that uh Hilbert envelope to yeah. Yeah, yeah. You can model it better. There is no echo or something like reve reverberance in the signal if you really go down with that compression factor to like zero point one, something that. Zero point one. So Right. It's it's s right. And it's still yeah. Yeah, you just preserve something different little bit. Let's say in R_S_S_. Yeah. I think Hynek has got ma many papers or some papers over that. So Oh. In I think I read it. Right, more or less. Right. Exactly. Yeah. Well I think you can really take those papers from uh Marius about F_D_L_P_. They are that's really enough on What is SAPA? Mm-hmm. Mm-hmm. Just something. Right. Approaches. Mm-hmm. But there are also reports from IDIAP I think. At least two reports from Marius seem to me. Yeah, they are more less they are same like. Mm. Mm-hmm. Yeah, yeah. Right, right. Yeah, yeah. Yeah. Yeah, definitely. Sometimes unreadable I would say. It's quite difficult, but yeah. Mm-hmm. Mm. Mm-hmm. Yeah, I think this year a lot uh Interspeech there was one paper. But it was regarded to speech recognition somehow. Y yeah, yeah. It was not something like speech coding or that D_C_T_ transforms. N nothing like that. So Right. Right. Mm. Maybe yes. Maybe no. I don't it's hard to say. Now it's not so bad with that if you really write it f well. And he did. I think so. Sure. R right. Yeah. Yeah. Yeah. Yeah. Whatever. That's not important. Uh .. Oh yeah. Sure, it is important for this case. But still Right. Yeah, and definite. November? November. What are you going to t talk or to teach? It's just that perceptual stuff and Mm-hmm. Do you already know how many students you have? No. Oh it's graduate for graduates. Well I think Guillermo was thinking about. And Hmm. Well I would like to go there, but not just for credits. Just maybe to hear something new. Mm-hmm. Right. Mm-hmm. Oh, sure. A any I think so. Why you are you doing this. Good question sometime asking Right. Thank you.
No I'm going to listen. Yeah. Yeah. Yeah. That's true. Yeah. Speech coding. No. Yeah. Yeah well that's uh yeah. So Uh well uh so basically I started reading about the features that's taking the long temporal and uh time frame, that's uh TRAPs. And uh tandem approach. Yeah, s yeah, yeah. So then I also started with the there was a paper on uh fitting a ear model to the D_C_T_ of a signal. Started with that and slowly getting used to the stuff. S Yeah. Uh I think yeah, yeah. It's uh the paper by Dan Ellis and co. Yeah. Yeah. Yeah. A_S_R_U_. Two thousand four I guess, yeah. Two thousand three, yeah. Yeah. Okay. Yeah, yeah. So for Yeah.. Mm-hmm. Yeah. Mm-hmm. Okay. Mm-hmm. Yeah. Yeah. Yeah. Yeah. So then I continued with uh another uh paper that you are mentioning, L_P_ TRAP. So I just moving around with that ideas. And also uh the main idea today why I told maybe is like if you can list down what all things are there to be taken care of. Then one by one. Because finally face find I do have something else. So uh maybe to list down the things other than the what I've already taken from you. And also to discuss generally. Yeah. Yeah. Feature extraction. Part tandem training. 'Kay.. So yeah, right now I have the feature extraction part where I can get the phoneme posteriors and through the neural net. So then I think I need yeah. Yeah, i yeah, but uh yeah, it's T_K_ through for some other task I've run it's T_K_. So I have the general feel. But uh maybe I can take the interceptor from you and run it once on my to have it. Yeah. So Yeah. Basically. So then I can can share what I ca have read and uh Yeah. Yes. So I'll use this. Yeah, the Let's do the feature extraction that we have already taken, then the back-end for that. Then? Yeah. Yeah. Okay. Uh so yeah.. Yeah. Yeah. Yeah. Traps I know. But though with various? Okay. Length. Ah. Okay. Frame. Okay. Filtering, yeah. Mm-hmm. Okay. Yeah. Mm-hmm. Mm-hmm. Okay.. Okay. Okay. Okay. Okay. Uh-huh. Mm-hmm. Okay. Okay. It's alright. Yeah. Yeah, that's ri Yeah. So compression basically you are looking at I guess, yeah. Mm-hmm. Coding.. Yeah. Okay. Uh-huh. Uh-huh. Okay. Uh-huh. Okay. Mm-hmm. No, that's for compression. I don't need for for compression you basically don't need the. Yeah. Mm-hmm. Mm-hmm. And P_L_P_. Triangle. Uh yeah. Mm-hmm. Okay. The speech signal, yeah. Mm-hmm. Mm-hmm. Okay. Mm-hmm. Mm-hmm. Mm-hmm. Mm-hmm. Okay. Mm-hmm. Yeah, yeah. Yeah. Yeah. Mm-hmm. Mm-hmm. Yeah. Yeah. Because Yeah, use maybe you can use the board for that, yeah. Can we remove this when you are going to this place? Oh. Periodic pulse, yeah. But this is uh Yeah. This is like a residual for him. Yeah, in the for uh frequency it is. For any object you measure you use for uh a reconstructed signal and uh Uh-huh. Yeah. H_P_. So what is the compression factor we are getting here? Uh-huh. Okay. Yes.. Mm-hmm. To fit the the Yeah. Mm-hmm. Yeah. Mm. Coming back though. Yes. Yes. Envelope, yeah. Yeah. Uh one thing I wanted to know is any um a good literature on these lines. Uh where you started with, yeah. Yeah, that's one yeah, o uh th oh. I have two with the Interspeech paper I Pattern? Okay, okay. Uh-huh. Okay. Yeah. Okay. Uh-huh. Uh-huh. Mm-hmm. Okay. Okay. Mm-hmm. Okay. Mm-hmm. Mm-hmm. Okay. Uh-huh. Uh-huh. Uh-huh. Yeah. So Yeah. Yeah. Mm-hmm. Mm-hmm. Yes. Yeah. Yeah. Yeah. So Yes. Uh yeah. Well, regarding this F_D_L_P_. Yeah, He doesn't send. Yeah. Yes. Yes. So Yeah.. November? Okay. Yes.. Yeah. Bu three I mean four credits? Or uh a total of four. Ah. Yeah. Because s s in I_S_ it's four uh credits and uh Guillermo is taking I think he's g yeah. Notebooks. Mm-hmm. Where was this? In okay. Yeah. But I can come to yeah. Later. Okay. Uh-huh. Yeah. So in case I need something on this F_D_L_P_, I'll come to you? Uh yeah? Okay. Okay. Takes your uh Uh-huh. Yeah. Yeah. Mm-hmm. Yes. Yeah, thanks.
Oh what what is the topic here? Well, I guess so. I couldn't. I would like to change. But I cannot. Well I think that I'm more now. Much more than me. Because Oh. Okay. Yeah. To the D_C_D_. Oh I cannot say it. To the Hilbert envelope you mean, no? Yeah, yeah. Yeah, okay. Yeah, yeah, yeah. Why that? I don't think so. Yeah, I just re-wrote in in C_ plus plus. I didn't do us so much on that. Well, we we tried uh many knobs to tune on th on that. Because you can decide what compression factor and everything. So I tried a little bit like um pick maybe three knobs and try to go through the three D_ three D_ cage to see what happens in which which kind of settings. So this we did. But we we didn't because there was still a question that I don't remember right now. Because there was some essential question I I don't remember, I would have to look at it f and because be before I wou before I we could u we could write anything I I would be have to be sure that something is right there. I couldn't be sure about it at the time. So and anyway, it it takes time, this this computation. It's it's very inefficient. It takes like hundred times more than computing some M_F_C_C_s computing these features, because it's uh huge D_C_T_, huge F_F_T_ everywhere. And Yeah. There is no optimisation. We just stick to the TRAP. We wanted to make as close to TRAP as is possible, which is silly, but that w you have to start somewhere. Yeah.. Y okay. So so what well, last time we we were s talking about what, about Gaussians? You you have the Gaussians running somehow. Like feature extraction. And then the tandem training c also, so the you are able to train the net now. No. No so they are going from ICSI? Huh. Yeah. Eh? But who who is the one who who is now developing the Quicknet and these things? The the M_L_P_ four and these things. Because it's It was Barry? Ah, I see. I see. Well still we have the experts who who made the software on here, you know. Olivier Bornet and Uh Quicknet training. Yeah. Yes, but Olivier make it run on on these machines and make the uh cross the two G_ barrier here at IDIAP. Because we couldn't we couldn't do something, I don't know what. But he is he is no good in Quicknet, right, I s I guess. Yeah. Um okay. So so? Right. So you just miss the back-end, which is easy because it's H_T_K_, it's everyone knows here. Okay. Yeah, okay. A and you you just need from us to hear w what are the other p front-ends and other features to to choose or something. Okay, so how do we do it? Shou shou should I list it? Or Yeah. Well, uh like I I d I distinguish between feature extraction and then the M_L_P_. Because see you feed something to the M_L_P_ which are not the waves. So so well and that's that's what I call it. And then the back-end. But okay. Well so what what I have here, what I've been working on since I was here was M_F_C_C_ P_L_P_, which is always needed for compressing purposes. Then TRAPs with w various lengths. And also uh you know TRAPs now? With var various lengths we were trying to to start them from from on the ultimate length of one one uh frame up to one second or even more. But it was mm so this. Also w then we apply the D_C_T_ on the TRAP itself, which is some something like the Gaussians. Applying some filt filtering. Basically the TRAP. So this. But this is mainly Franta who was doing this. So this is related to TRAPs. I was also trying the warping. Means if you have a second um which is uh a hundred and one samples, I was just keeping the central ten ten samples. And then I was throwing away some some of them, you know. Like with sampling this this uh one second of the TRAP to something which I which I uh preserved all the points in the centre and just uh picked every tenth point at the boundaries, which means the TRAP uh hundred and one TRAP trajectory shrank to say fifteen points with uh with still preserving the maximum resolution at the centre and throwing away most of the points near the boundary. So this I was trying. So these I have some results on. So these are TRAPs. Then the Gauss TRAPs. Then tandem system. Uh the tandem you plus minus half. Because that's that's just a P_L_P_s. And then there is a training uh with M_L_P_ that's taking some context of these P_L_P_ coefficients. So it's like feeding the P_L_ uh P_L_P_s or M_F_C_s, w whatever to M_L_P_ and taking some context. This is the tandem. So this set-up we also have here. I didn't experiment with this. But maybe you some more results oh it's Hemant who has the results on tandem I guess, no? Yeah, yeah. Right. Yeah. But uh I think the uh I think the most important results are already in the thesis he's writing. So so is there some somewhere. So this is tandem. Uh then uh then what? And then there is the F_D_L_P_. Then we can talk more about. Which is which I have more experiments on. Also this warping and compressions and all these things. So Well it's bil still sub-bands, no? Well yeah. But that you just didn't talk about any sub-band so far. And by the way, you don't do the overlap on the sub-bands, no? Or you do? Yeah, but you don't do the overlap on the time domain? You said somethi yeah. Okay, so that's the difference. Hmm. P_L_P_s. M_F_C_C_ or triangles? Okay. So you know about this compression, how how does it why it's okay. I just don't know. Ah he's new. He's he's doing you most of the things. So Okay. Well what? I what? I I d yeah. Well the beginning of the Oh yeah. Sure, sure, sure, sure. Okay. Yeah. Okay. So um just they in th um in the speech uh re-synthesis or coding, what you're doing, um did you try to to play with the knobs somewhere. Like uh many uh uh v widths? I m no no. The compression number of bands and lengths of the so what you what you were playing with and what other result? Like what to use so that it sounds the best? Yeah, o of course you you listen to it yourself. I d I'm fine with that. Because Yeah. Okay. But If you go down? I see. So so you won't say that the L_P_C_ is not able to fit it so that the residual was easy? Will be hero. I don't think so. No, no, no, no. are not supposed I think you have to push it push it as much as possible because it reacts to the Yeah. Yeah. Oh which lines? Yeah. Okay, okay, okay, okay, okay. Mm-hmm. I see what you mean. This is the sub-band time signal. It's Yeah. I see. Okay. So Mm-hmm. I see. I was thinking the residual in L_P_C_ fit of the Hilbert envelope, not this. I was not thinking of this at all before, you know. 'Cause there are three things. First you have to encode this, then the L_P_C_ coefficients and the residual after L_P_C_ fit if you like. Oh you don't This is the residual? Yeah. Well i Okay. Yeah. Yeah. Yeah, right. Mm-hmm. Yeah. That that's nice. Okay. And so so so the so the broader bands do what on this picture? L left left. Hmm. Yeah, that that's Yeah.. Although Herve just said that we shouldn't s talk about confidential things here. That's that's the problem. Yeah. Okay. Yeah. Oh okay. So so there is a compromise between the the lower bands the the worse stuff you just said and and the more bands the more coefficients to transmit. So fifteen is about okay. Okay. So this. And then there is a compression. What about that? So that you get so that you can't model it actually. Yeah. Of what? Oh oh oh, compression. 'Kay. The sub-band signal. Sub-bands power spectrum. L_P_C_. L_ uh L_P_ Yeah. S Well yeah. Yeah, yeah. Go on. Uh Yeah, they are. About three papers. There was uh the s s s SAPA or how it's called, the confere then then And one is in in the America uh yes. Yeah. I I don't know. I just n You were You don't know. Evaluation, that's that's bad. 'Kay. Yeah. Also I can um before Marius left, he he has he had actually left himself. But all the files are here. You know. So all the m MATLAB implementations of these are here. And there is also one big table that he used he used to yeah. He was also playing with all these compression factors and lengths of the of of that. And there is a big table in in Excel which you can look at. Yeah yeah yeah. This wa Uh yeah. Because it it it took a huge amount of time. And he basically he he was the first one to realise that these these guys are pretty slow to to do these things. So he he he just uh he just uh put here the P_ make environment and he didn't stick to any cluster or anything. He just put P_ make and run it on uh on all the machines here. So at some point he was using all the available memory by any mistake, you know. So at th in the morning all the machines were were just stuck. D no one could ever m move the mouse. So these these resul yeah. Because he was doing everything in MATLAB. So it was again ten times more demanding than C_ plus plus and then Mm But he did. He wrote it well. But anyway anyway. Because I w i in the code that I wrote I was like playing also with if if the mat matrix is parse I really don't use it all. I just multiply from here to here from here to here. Not to matrix p my matrix. And then I I just optimised it a bit. And also BLAST libraries and all these sta things. But okay. Oh well whatever. But the t Uh Yeah. Yeah. But there is a the back-end recognizer which I will give you which is turn-around of eighteen minutes or well from eleven to fifteen minutes. Normally it would take hour and a half. Yeah no. No today it was over one hour. It's just like ten times less maybe or five to ten times less. But still still you can you can five times more compute. Great. I have to talk to you about these. I I don't know. I I didn't try to. I didn't even mail to to Chernotsky, I must say. Uh I just talked to one girl there which told me that he would be possibly able to find me a job there. Any any job, like related maybe to field. But job. Not not anything any research. So if if nothing then then I could be able to get the job somewhere. Yeah, probably. Yeah, but t s but yeah. 'Kay. We hope that you find out that everything was wrong and that the potential is much bigger. Yeah, it actually work that too. No. But after you find out the back then it will be that will be the yeah. So we will send you the papers related. And I will point you to all the code which I have. And yes. Yeah, yeah, yeah. Yeah, I can play it back. So I can just now tell the the locations of all the files and you can rewind. Yeah, yeah, ferret. Um so Any any more things? Yeah, until twenty four. It's the Friday. So November, yeah. Mm.. Mm-hmm. It was yeah, the last time it was for me it was good, the course from my from one special reason, because in the f when there th were the flats in I lost all my papers from my school. All the all the exercise books. Um how to say this? Note note yeah, all the notepads and everything, all the notes from any courses. So I was remembering what I was learned or what I was taught at the time. So so I have it now back, at least partially. 'Cause all the all the algorithms you were speaking about Yeah. Uh uh I would add Yeah. Now. Mm-hmm. Mm. Oh. some Yeah. Yeah. Yeah and also in the in the l lab labs the d people didn't know anything about it and they just follow the papers. It was Yeah. Uh Not to this topic. Yeah, but anyway after I'm back I'll be still on Skype and all these things, so no problem to contact me until, I don't know, May next year. So it's fine. Well I would like to, yeah. I should. But I would have to study here for that that long time, no? I th Two two years, yeah. And I would have to stay one more. Uh well it would be good reason for Petra that I have to stay here.. Yeah, yeah. Because she has a good position because she has a friend which is starting who is starting a lab there. And everything is it's a great opportunity for her at least. But the problem is and then she will go back and there's uh f family issues and all these things, you know. For a girl it's always the pain. Because she has to break for some time, at least a little. But has to break. And in in the science it's not that easy to to break. Okay. Yeah. Yeah. But Yeah, but I think definitely you are the best person here to with uh related to availability of you. Because if if if I want to go to Herve, I probably call secretary. He's there or and when I can and i there was no op uh no ev every time I came to you you were available. At least it That's great. You know the core, which we don't remember anymore programming. We know the details and you know what what is th what is the global stuff. Uh ask Hynek. Ask my boss. Yeah. Yeah. Oh.
